feat: v2 - ai assistent - debug agent by maxy-shpfy · Pull Request #2332 · TangleML/tangle-ui

maxy-shpfy · 2026-05-28T00:30:34Z

Description

Introduces a debug-assistant sub-agent that diagnoses failed pipeline runs by inspecting execution details, container state, and truncated logs. The agent is read-only and cannot mutate the pipeline spec or submit runs.

The dispatcher is refactored from a handoff-based architecture to an Agent.asTool(...) architecture, where each specialist (ask_general_help, ask_pipeline_repair, ask_debug_assistant) is exposed as a tool. This allows the dispatcher's own LLM loop to chain multiple specialist calls in a single turn — for example, calling ask_debug_assistant to identify a root cause and then ask_pipeline_repair to apply the fix, without requiring a separate user message.

pipeline-repair gains access to submit_pipeline_run so it can resubmit a run after a successful fix when the user explicitly requests it. The repair prompt is updated with a directive-based entry path: when the dispatcher passes a concrete CSOM mutation directive (e.g. derived from a debug-assistant diagnosis), pipeline-repair skips the validation-driven discovery flow and applies the targeted mutation directly.

The AgentSession now carries recentRuns, a list of the five most recent pipeline runs for the open pipeline. These are appended to the debug-assistant's system prompt at agent creation time so the model can resolve "my last run" or "the latest run" without an extra tool call.

New runTools and debugTools factories expose submit_pipeline_run, get_run_status, debug_pipeline_run, get_execution_details, get_execution_state, get_container_state, and get_container_log as agent tools. The toolBridge implements the corresponding ToolBridgeApi methods, including a composite debugPipelineRun that fetches the run, root execution state, and a truncated snapshot of every failed child execution in a single call. Payload truncation (log text capped at 8 KB, debug_info capped at 20 keys) is applied both in the bridge and in the debugTools layer to prevent large pod logs from consuming the model's context window.

The AiChatContent component fetches recent runs for the open pipeline via React Query and passes them through to the worker on each ask call. The fetchContainerLog helper is extracted into executionService and reused by both the existing logs UI component and the new bridge methods.

Observability status labels are added for the three new ask_* tool names so the status line updates correctly while a specialist is working inside a nested asTool run.

Related Issue and Pull requests

Type of Change

Checklist

I have tested this does not break current pipelines / runs functionality
I have tested the changes on staging

Screenshots (if applicable)

AI Assistant - Debug agent.mov (uploaded via Graphite)

Test Instructions

Open a pipeline that has at least one failed run.
Ask the assistant "Why did my last run fail?" — the debug-assistant should identify the failed task, cite the exit code and log excerpt, and emit a Fix to apply: directive if the cause is a single concrete input value error.
Ask "Investigate the recent failure and fix the pipeline" — the dispatcher should chain ask_debug_assistant followed by ask_pipeline_repair and return a combined diagnosis + change summary.
Ask "Investigate, fix, and rerun" — same as above but pipeline-repair should also submit a run and return the new run id.
Ask "Why did run <id> fail?" with an explicit run id to verify the agent resolves it directly without consulting the recent runs list.
Verify the status line updates during each specialist call (e.g. "Analyzing run failure..." while debug-assistant is working).
Confirm that asking a general Tangle question still routes correctly to ask_general_help.

Additional Comments

The Agent.asTool(...) migration means the agent_handoff event no longer fires from the dispatcher. The SUB_AGENT_LABELS map in observability.ts is retained for any sub-agent that uses internal handoffs, but the dispatcher's specialists are now covered by the ask_* entries under TOOL_STATUS_LABELS.

github-actions · 2026-05-28T00:30:43Z

🎩 Preview

A preview build has been created at: 05-27-feat_v2_-_ai_assistent_-_debug_agent/c45f945

maxy-shpfy · 2026-05-28T00:30:43Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

camielvs · 2026-06-01T23:04:46Z

To clarify: this does not make the ai chat available on the runs page? I think there's limited use of querying a failed run in the UI without actually being able to see it, but I presume that's functionality intended to come later

camielvs · 2026-06-01T23:14:54Z

Second clarification: does this only work on pipelines with failed runs? I asked for the status of my latest run and it told me it couldn't do it. Same when I asked it to rerun a pipeline that had no issues on it.

camielvs · 2026-06-01T23:20:01Z

+  // The bridge closes over the navigation + undo stores plus the
+  // backend/auth/queryClient deps captured via refs. A single instance
+  // per AiChatContent mount is correct: each method call re-reads
+  // `navigation.rootSpec`, the active path, and the latest backend
+  // values lazily so navigation / backend changes are picked up
+  // without rebuilding the bridge.


this comment might not be needed

maxy-shpfy · 2026-06-02T19:53:35Z

Merge activity

Jun 2, 7:53 PM UTC: A user started a stack merge that includes this pull request via Graphite.
Jun 2, 8:12 PM UTC: Graphite rebased this pull request as part of a merge.
Jun 2, 8:14 PM UTC: @maxy-shpfy merged this pull request with Graphite.

This was referenced May 28, 2026

feat: v2 - ai assistant chat window #2327

Merged

feat: v2 - ai assistant web worker scaffolding #2328

Merged

feat: v2 - ai assistant llm proxy and dispatcher agent #2329

Merged

This was referenced May 28, 2026

feat: v2 - ai assistant - tangle help subagent #2330

Merged

feat: v2 - ai assistent - repair subagent #2331

Merged

refactor: v2 - ai assistant - split tools bridges #2333

Merged

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_debug_agent branch from 7cec3fc to 7dc9561 Compare May 28, 2026 05:42

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_repair_subagent branch 2 times, most recently from 21732c0 to b490705 Compare May 28, 2026 06:12

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_debug_agent branch from 7dc9561 to 7688762 Compare May 28, 2026 06:12

maxy-shpfy marked this pull request as ready for review May 28, 2026 06:18

maxy-shpfy requested a review from a team as a code owner May 28, 2026 06:18

This was referenced May 28, 2026

feat: v2 - ai assistant - architect subagent #2335

Merged

chore: v2 - ai assistant - unit tests #2336

Merged

maxy-shpfy added the #gsd:50790 label May 28, 2026 — with Graphite App

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_debug_agent branch from 7688762 to c2bd0c1 Compare June 1, 2026 16:41

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_repair_subagent branch 2 times, most recently from 95377e0 to c9e1443 Compare June 1, 2026 16:53

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_debug_agent branch from c2bd0c1 to 95746f5 Compare June 1, 2026 16:53

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_repair_subagent branch from c9e1443 to ecac04d Compare June 1, 2026 17:07

maxy-shpfy force-pushed the 05-27-feat_v2_-_ai_assistent_-_debug_agent branch from 95746f5 to d9d21ad Compare June 1, 2026 17:07

maxy-shpfy mentioned this pull request Jun 1, 2026

chore: v2 - ai assistant - architecture md #2344

Draft

8 tasks